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Abstract 

Dzhafarov, Zhang, and Kujala (Phil. Trans. Roy. Soc. A 374, 
20150099) reviewed several behavioral data sets imitating the formal de¬ 
sign of the quantum-mechanical contextuality experiments. The con¬ 
clusion was that none of these data sets exhibited contextuality if un¬ 
derstood in the generalized sense proposed in Dzhafarov, Kujala, and 
Larsson (Found. Pliys. 7, 762-782, 2015), while the traditional defini¬ 
tion of contextuality does not apply to these data because they violate 
the condition of consistent connectedness (also known as marginal se¬ 
lectivity, no-signaling condition, no-disturbance principle, etc.). In this 
paper we clarify the relationship between (in)consistent connectedness 
and (non)contextuality, as well as between the traditional and extended 
definitions of (non)contextuality, using as an example the Clauser-Horn- 
Shimony-Holt (CHSH) inequalities originally designed for detecting con¬ 
textuality in entangled particles. 


1 Introduction 

This paper is based on two talks given at the conference Quantum Theory: From 
Foundations to Technologies organized by Andrei Khrennikov at the Linnaeus 
University in Vaxjo, Sweden. The content of these talks has been to a large 
extent published elsewhere m, and the present paper focuses on one specific 
aspect of these talks: the relationship between (in)consistent connectedness and 
(non) contextuality. This focus was prompted by a recent extensive exchange 
of personal communications involving a few of our colleagues and related to 
a new experiment announced by Aerts and Sozzo [Bj. The issue in question 
is by no means new: it was in fact raised and discussed in Ref. (7], using as 
an example an experiment by Aerts, Gabora, and Sozzo 0. Later, this issue 
has become a central one in the development of our approach to contextuality, 
called Contextuality-by-Default (CbD) [TU5lf!7i 1TV| . It has become clear from the 
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discussion in question, however, that there are still serious disagreements about 
this issue. The aim of this paper is to offer a resolution for these disagreements 
and to dispel possible conceptual confusions. 

Although prompted by a discussion of Ref. [6], this paper is not meant to 
be a critique of that or any particular paper. We use the experiment presented 
in Ref. [B] and the paradigm in which it was conducted only as an example, 
one providing an opportunity to demonstrate the workings of our theory of 
contextuality and to make our points. We would like therefore to play down the 
critical aspects of this paper. 

2 A list of important terms and notation conven¬ 
tions 

Special terms used in this paper are rigorously defined and every notation con¬ 
vention is stipulated. The reader may, however, find it useful to consult the 
following list from time to time to recall a term or to more easily find where it 
is systematically discussed. 

Measurements (random variables). The generic notation for random vari¬ 
ables is R q , interpreted as the measurement of property q in context c. If c can 
be ci,..., Cm and q can be qi,...,q n , then instead of Rql we write R( (see 
Section [3J. 

In the “Alice-Bob” variant of cyclic-4 systems (Sections [3] and |8|, we re¬ 
place the general notation R c q for the measurements by A-B notation, with the 
following table of correspondences: 

An B n A 12 B 12 A 21 B 21 A-22 B 22 /1 \ 

p 1 pi p4 p4 p2 p2 p3 p3 * \-V 

_/ti -/^2 XX ^ XX4 1X3 XX2 XX 3 XX4 

The logic of these correspondences is explained in Section [51 (l25l) and (1261) . 

Bunches and connections. In a set of measurements: a subset of all R' q 
with the same c and different qs forms a bunch of measurements representing 
context c; a subset of all R c q with the same q and different cs forms a connection 
of measurements representing property q (Section [5]). 

Consistent connectedness. Some connections have a certain property, ©, 
that makes them consistent ; and a system of measurements with all its con¬ 
nections consistent is consistently connected. (See Sections [3] and El) The term 
is close to such terms as no-signaling, no-disturbance, etc., but void of their 
physical connotations. 

Contextual and noncontextual systems. (Non)contextuality of a consis¬ 
tently connected system of measurements is defined in Section [BJ Definition 16.41 
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The general definition of (non)contextuality, for arbitrary systems, is given in 
Section [7] Definition 17.II 

Couplings: S'-notation and '/'-notation. The notion of (non)contextuality 
is based on the notion of a (probabilistic) coupling. Definition 16.11 defines the 
couplings for cyclic-4 systems, and the subsequent remarks explain how the 
definition applies to arbitrarily-connected systems of random variables. 

When a coupling is constructed for all measurements in a cyclic-4 system, 
then for each R q in this system we denote its counterpart in the coupling by S q 
(Definition 16.11) . 

When a coupling is constructed for only a part of a cyclic-4 system, specif¬ 
ically for pairs of the measurements R q ,R q forming a connection, then the 
corresponding elements of the coupling are denoted T q ,T q (Definition 16.2D . 


3 Systems of measurements 

Of the two concepts characterizing a system of measurements, (in)consistent 
connectedness and (non)contextuality, the former is about distributions of the 
measurements of one and the same property in different contexts, whereas the 
latter is about the (im)possibility of imposing certain joint distributions on all 
the measurements, for all properties and all contexts involved. 

Let a property q be measured in contexts c and d. These measurements 
can be denoted R c q and R q . The property q may be a spin in a given particle 
along a given axis, and the contexts c and d may be defined by what other spins 
(say, in other particles) are measured together with this one. Outside physics, 
the property q may be a question, and the contexts c and c' may be defined by 
whether this question was asked first or following another question. Examples 
can be easily multiplied, within physics and without. The measurements R q 
and R q are two different random variables, and their distributions can be the 
same or different. If they are the same, we write 

R c q ~R c q , (2) 

and if this distributional equality holds for any q and any contexts c, c' in which q 
is measured, we say that the system of measurements is consistently connected. 
This term derives from the term connection that we use to denote a set of 
all measurements of a given property in all contexts in which it is measured. 
For instance, if property q is measured in three contexts, c,c!, c". and in no 
other contexts, then the set | R q ,R q ,R q j is the connection for q. Consistent 
connectedness is known under many different names: no-signaling condition m , 
marginal selectivity 0[14j(15], no-disturbance principle m , etc. (see Ref. Em 
for a few other terms). 

Contextuality is about all measurements R q composing a system. Such a 
system can always be viewed as a set of bunches, where a bunch is defined 
as the set of all measurements made within a given context. For instance, 
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let q,q',q" be measured in a context c (e.g., q,q',q" are three spins measured 
simultaneously, or three questions asked of one and the same person), and let 
no other properties be measured in that context. Then the set {R q , R q ,, 7?“,,} is 
the bunch (of measurements) representing the context c. The random variables 
within a bunch are jointly distributed, which means that they can be viewed as 
a single (“vector-valued”) random variable. 

Assuming the numbers of the properties and the contexts are finite, one can 
present the system of measurements in the form of a matrix, in which rows cor¬ 
respond to the properties {qi ,..., q n } and columns to the contexts {ci,..., c m }, 
and each cell (i,j) is filled with the measurement R\ if q t is measured in context 
Cj (and is left empty otherwise). 


R\ 

bunch representing Cj 

The random variables in any row of this matrix form a connection for the cor¬ 
responding property, and those in any column form a bunch representing the 
corresponding context. 

We will focus in this paper on a special system of measurements, a cyclic sys¬ 
tem of rank 4 TOM], or cyclic-4 system for short. Its best known implementation 
is the “Alice-Bob” version of the Einstein-Podolsky-Rosen-Bolnn system (EPR- 
B, where B can also stand for Bell). This system has been prominently studied 
in relation to contextuality since John Bell’s pioneering work jTHJQlI], although 
the conceptual framework used in quantum physics (entanglement, nonlocality) 
initially did not include contextuality explicitly. Outside quantum physics the 
term “nonlocality” rarely makes sense, and “entanglement”, if it does make sense 
(even if only metaphorical), can always be taken as a possible “explanation” for 
contextuality, if observed. We will return to the EPR-B implementation of the 
cyclic-4 system in Section [8j 

The system in question can be presented in the format of the matrix ([3]) as 
follows: 



Cl 

C2 

C3 

C 4 


<71 

R\ 



R\ 


<72 

R\ 

R\ 



(4) 

93 


Rl 

Rl 



94 



Rl 

R\ 



We will change the notation later, in Section [8] to conform with the traditional 
interpretation of the properties and contexts involved, e.g., in a pair of two 
entangled particles. For now, one can think of any four properties measured 
in any four contexts so that (a) each context contains two properties measured 
together; (b) each property is measured in two different contexts; (c) no two 
contexts share more than one property; and (d) each measurement is a binary 
random variable, with values ±1. 


connection for qi 


(3) 
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4 Traditional understanding of contextuality in 
cyclic-4 systems 

The traditional understanding of contextuality in the cyclic-4 paradigm can be 
presented as follows. Let us assume the measurements R c q ,R c q of any property 
q in the contexts c, d in which it is measured to be in fact one and the same 
random variable, R*. This assumption can be referred to as that of context- 
irrelevance , and in many traditional treatments it is made implicitly, by the 
virtue of indexing the measurements by the properties being measured but not 
by the contexts. The assumption implies that our matrix (jTj can be written as 


91 

Cl 

Rl 

C2 

C3 

c 4 

Rl 

92 

TD* 

rL 2 

TD* 

K 2 



93 


R-i 

r* 3 


94 



XL 4 

TD* 

XL 4 


immediately and trivially implying consistent connectedness: e.g., R\ measured 
together with R 3 in context C\ is precisely the same random variable as R\ 
measured together with R\ in context C 4 (otherwise they could not be denoted 
by the same symbol R\)\ and, of course, a fixed random variable has a fixed 
distribution. 

It is easy to show that if random variables are understood within the frame¬ 
work of the classical, Kolmogorovian probability theory (KPT), then the four 
random variables {i?2, j -R 3 , -R 4 } in system J5]) possess a joint distribution. In¬ 
deed, the random variables (I?*, R 2 ) in context Ci are jointly distributed, which 
means that they are two measurable functions defined on the same probability 
space 5, 

R* : S {—1, +1} , i ?2 : S' —» { — 1, +1} • ( 6 ) 

[More precisely, S' is a set in a probability space (S, £,/it), where £ is a sigma- 
algebra (set of events) on S and fx some probability measure. A function X : 
S —> {—1,+1} is measurable (and therefore A is a random variable) if the set 
of values mapped into +1 is an event (i.e., it belongs to £, and therefore has 
a well-defined probability value). We conveniently confuse the set S with the 
probability space containing S.] 

The random variables (f?*,I?2) in context C 4 are also jointly distributed, 
whence R\,R\ are measurable functions on the same probability space. This 
must be the same space S as above because the variable R\ in the contexts ci 
and C 4 is the same. Hence 


R* 4 :S^{- 1,+1}. (7) 

Finally, in context C 2 , the random variables (R%, R 3 ) are jointly distributed, and 
we conclude that 


R 3 ■ S —> {— 1 , + 1 } • 


( 8 ) 
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As a result, all four random variables in <SD are measurable functions defined 
on the same probability space, i.e., they are jointly distributed. 

[There is a naive way of arriving at the same conclusion, by assuming that 
in the KPT any set of random variables is jointly distributed. This view is 
untenable Eunj-i 

Now, the joint distribution of {Rl, R 2 , R 3 , Rl} is unobservable, because 
no two measurements made in two different contexts (such as {RjyR^} or 
{R 2 ,R|}), “co-occur” in any empirical meaning of “co-occurrence”. One can 
only observe (i.e., estimate from observed frequencies of co-occurrences) the 
distributions of four specific subsets of {i?J, R 2 , Rg,R|}, the pairs of random 
variables forming the columns of matrix (fojl. We have the following theorem 
about these pairs that was first proved, mutatis mutandis, in Ref. m■ In its 
formulation, (•) denotes expected value, and the maximum is taken over all 
combinations of + and — signs replacing ± so that the number of the — signs 
is odd (1 or 3). 

Theorem 4.1. In any system described by (Mh 

max (± (RjR 2 *) ± (R* 2 R* 3 ) ± (R* 3 R* 4 ) ± ( R\R *» < 2. (9) 

odd number 
of-' s 

The inequality ([21) is usually presented as a necessary condition for the ex¬ 
istence of a joint distribution of {RJ, R 2 , R 3 , R|}, implying that J2]) can be 
violated, in which case {Rl, R 2 , R 3 , R^} do not have a joint distribution and we 
say that the cyclic-4 system is contextual. This understanding, however, lacks 
logical rigor. If the left-hand side of ([9]) can be computed at all, then the ex¬ 
pected products (R*[R 2 ),..., {R 4 R 4 ) in it are well-defined, whence each of the 
corresponding pairs (Rjy R 2 ),..., {R%, Rl) has a well-defined joint distribution. 
But then, as we have seen, the entire set {R^, R 2 , Rg, R|} has to have a joint 
distribution too, and then, by Theorem 14. II ([9]) must hold. It simply cannot be 
violated. 

Put differently but equivalently, if {Rg, R 2 , Rg, R|} do not possess a joint 
distribution, then at least two of the four pairs (R*, R 2 ),..., (RjJ, R^) forming 
columns of matrix ([5]) do not have joint distributions (because a global joint 
distribution follows from any three of these pairs being jointly distributed). But 
if this is the case, the left-hand side of ([9| simply cannot be computed. 

5 Consistent connectedness and contextuality in 
traditional understanding 

One can be easily confused by the reasoning above, because it may seem that 
it is trivial to construct a system ([5]) in which all four expected products 
(R 4 R 2 ),..., (R^Rl) are well-defined while fl 2 [) is violated (and it is routinely 
claimed that quantum mechanics predicts such situations and experiments con¬ 
firm these predictions). This seemingly trivial possibility, however, is merely 
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Table 1: Example in which the left-hand side of (0 is seemingly well-defined 
and exceeds 2. Since this is mathematically impossible, there should be a hidden 
assumption here that is false. 

iZJ 



+1 

-1 


+1 

i/2 

0 

i/2 

-1 

0 

i/2 

i/2 


i/2 

i/2 


+ 1 

-1 

+1 

i/2 

0 

i/2 

-1 

0 

i/2 

i/2 


i/2 

i/2 




+ 1 

-1 


+ 1 

i/2 

0 

i/2 

-1 

0 

i/2 

i/2 


i/2 

i/2 


+ 1 

-1 

+ 1 

0 

i/2 

i/2 

-1 

i/2 

0 

i/2 


i/2 

i/2 



an illusion, because such a construction would be one of a mathematically self¬ 
contradictory system. One example is given by the four distributions in Table [l] 
where entries within the 2x2 interiors are joint probabilities, while the margins 
show marginal probabilities. The expected products here are 

= <^3> = - (RtK) = (KRt) = i, 

and the left-hand side of © is 4, violating the inequality. As this system is 
mathematically impossible, we must have made an assumption that this con¬ 
tradiction demonstrates to be false. 

What might this assumption be? Can it be that i?* are i ?2 are not jointly 
distributed, or that they are not well-defined random variables? The answer 
is clearly negative: R\ are R% are observed empirically and jointly. The same 
reasoning applies to the remaining three pairs: in each pair the two random vari¬ 
ables are well-defined and jointly distributed. The only possible error therefore 
is in the identity of these random variables across different contexts: we have 
assumed, e.g., that Rt, measured together with R% is the same random variable 
as measured together with R\. It must be wrong to label the measurements 
by the measured properties only, ignoring the contexts. 

This means that a correct initial representation of the system would be as 
in Table [2] with the random variables contextually labeled, so that the pairs 
of measurements forming different bunches do not overlap. If one makes the 
assumption that, for any property q and any two contexts c, c ', the measure¬ 
ments R c q and R c q in this matrix are “one and the same variable” R*, then this 
assumption is rejected by reductio ad absurdum: if it were correct, © would 
have to hold, and it does not. 

Below we will present a rigorous way of formulating the hypothesis that 
random variables measuring the same property in different contexts are (in 
some sense) “the same”. We already have, however, sufficient clarity about 
this hypothesis to address the often misunderstood question of the relationship, 
within the framework of this hypothesis, between the concepts of consistent 
connectedness and contextuality. 


































Table 2: Example of Table Q] without the assumption of context-irrelevance: 
the identity of measurements (random variables) depends not only on what is 
measured but also on the context in which it is measured. 


R\ 


R 


4 

4 



+1 

-1 


+1 

i/2 

0 

i/2 

-1 

0 

i/2 

i/2 


i/2 

i/2 


+ 1 

-1 

+1 

i/2 

0 

i/2 

-1 

0 

i/2 

i/2 


i/2 

i/2 



Rl 



+ 1 

-1 


+ 1 

i/2 

0 

i/2 

-1 

0 

i/2 

i/2 


i/2 

i/2 


+ 1 

-1 

+ 1 

0 

i/2 

i/2 

-1 

i/2 

0 

i/2 


i/2 

i/2 



R\ 


R 


4 

1 


Rl 


It is clear that the assumption of consistent connectedness can be formulated 
and, in special cases, even justified without assuming context-irrelevance. Its 
formulation for the cyclic-4 system presented in the form fij is 


p4 _ pi pi p2 
lx l I 5 1X 2 1X 2 > 

R 2 3 ~Rl Rl~Ri 


( 10 ) 


Such a hypothesis can often be entertained without assuming that the identi¬ 
cally distributed random variables are “the same”. For instance, in the classical 
entanglement paradigm for two electrons, property 1 corresponds to Alice’s 
choice of a certain axis in her particle, and the context Ci is defined by Bob’s 
simultaneously choosing axis 2 in his particle, while the context C 4 is defined 
by Bob’s simultaneously choosing axis 4 (on labeling Alice’s two axes 1,3, and 
Bob’s two axes 2,4). If the two particles are space-like separated, one should 
assume that Bob’s settings cannot influence Alice’s measurements, which im¬ 
plies the distribution of R\ is the same as the distribution of Rf. No physical 
principle prevents one, however, from viewing R\ and Rf as different random 
variables with one and the same distribution. We have seen already that one’s 
denying this view leads to a mathematical contradiction. 

The expected products in ([9j also can be written without regard to the 
context-irrelevance hypothesis. One can replace (R^R^) with (R\R\), (R^R^) 
with (l? 2 -R§), etc. to obtain the following analogue of inequality 

max (± (R{Rf) ± (R 2 2 Rl) ± {R\R\) ± (RfRj)) < 2. ( 11 ) 

odd number 
of-' s 


Using this formulation, Theorem l4.1l can be understood as stating that (HID holds 
under the context-irrelevance hypothesis. If this hypothesis does not hold, m 
does not have to be satisfied and therefore cannot be derived as a theorem. One 
can always check whether it holds or not, but the outcome has no known to 
us interpretation if R q and R q are not assumed always to be the same. Now, 
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the context-irrelevance hypothesis simply cannot be entertained if consistent 
connectedness is violated: “one and the same” random variable cannot have two 
different distributions. There is therefore no reason for checking the inequality 
(fill) if (flOll does not hold. 

The only exception can be made in an imaginary situation wherein the con¬ 
sistency of connectedness is not known (e.g., it is not established in a statistically 
reliable way), but one knows (in a statistically reliable way) that the inequality 
m is violated. In this case one can reject the context-irrelevance hypothesis by 
the following reasoning: (i) if the system is consistently connected, then the hy¬ 
pothesis of context-irrelevance leads to m, which is rejected; (ii) if the system 
is not consistently connected, the hypothesis of context-irrelevance is rejected 
as well; (iii) hence this hypothesis is rejected. 


6 Traditional understanding of contextuality trans¬ 
lated into the CbD language 


In accordance with the CbD approach, R q and R q (c ^ c') are a priori different 
random variables, and since they are never observed “together” (in any empiri¬ 
cally grounded sense of this word), they do not posses a joint distribution. The 
conceptual coherence and advantages offered by this understanding of random 
variables recorded in different contexts has been discussed in Refs. |TI 1H1121| . In 
the framework of KPT this means that R c q and R q are functions defined on two 
different probability spaces: 

Rq '■ S c ~t {—1,+1}, Rq : S c i —> {—1,+1}. (12) 


It is therefore impossible to hypothesize that R q and R c q (c ^ c') are in fact 
“the same”. Nor is it possible to treat these R q and R q as “different but always 
equal to each other”, 


Pr 



= 1 , 


(13) 


since this statement also implies a joint distribution of (R. q ,R q j, translating 
into S c = S c >. 

To formulate the analogue of the context-irrelevance hypothesis within the 
framework of CbD one has to use the foundational notion of a (probabilistic) 
coupling. 


Definition 6.1. A coupling for the cyclic-4 system (|D) is a set of eight jointly 
distributed random variables 


such that 


( ql ql o2 c2 c3 c3 o4 q4\ 

) J 2 i J 2 1 °3 1 °3 ! ‘-’41 °4 > J 1 ) 

(SIS 1 ,) ~ (RlR 1 ) , (Si Si) ~ (Rl Rl) , 
(Si Sf) ~ (Rl Rl) , (Si, St) ~ (i?|, R\) . 


(14) 

(15) 
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In other words, the bunches of the system are distributed as the correspond¬ 
ing marginals of the coupling. A system has generally an infinity of couplings. 

The notion of a coupling is not confined to cyclic-4 systems. It applies to any 
system of random variables, the idea being that (a) the coupling is a set of jointly 
distributed random variables in a one-to-one correspondence with the variables 
constituting the system being coupled; and (b) the observable parts of this 
system are distributed in the same way as the corresponding marginals (subsets, 
or subcouplings ) of the coupling. In particular, the system being coupled can be 
a connection of the cyclic-4 system. 

Recall that the connection for property q is the set of all random variables 
measuring q in different contexts. In the cyclic-4 system the connection for 
property 1 is {!?}, I?f}, for property 2 it is {R^, R%}, etc., along the rows of the 
matrix 0. Each of these connections taken in isolation has its couplings. 

Definition 6.2. A pair of jointly distributed random variables j is a 

coupling of a connection l R q ,R q > in a cyclic-4 system if 


Tq ~ R c q , T c q ~ R q . (16) 

The coupling | T q , T q | is called maximal if the probability with which T ql T q 
attain equal values, Pr | T q =T q , is maximal among all couplings of | R ql R q j. 

Another way of stating the second part of the definition is that Pr | T q = T q J 

is as large as it is allowed to be by the distributions of T q and T q , which are 
fixed by (fl 6 l) . The following theorem says that this concept is well-defined. 

Theorem 6.3 (Refs. [11 [5]). A maximal coupling ^T q ,T q | of a connection 

^Rq,Rq | TO a cyclic-4 system exists and its distribution is unique: it is defined 
by HE and 


(r C q T*) = 1 - | (Tq) - ( t /) | = 1 - | < R C q ) - (<) | , ( 17 ) 


or equivalently, 


Pr 


T„ c = Tf' 

ii 

q q 



Pr [R c q = 1] - Pr 


Rq =1 


(18) 


The notion of a maximal coupling and the existence part of the theorem 
above can be generalized to arbitrary systems [HElEIj, but in this paper we 
focus on the cyclic-4 systems only. 

It is easy to see that if a cyclic-4 system is consistently connected, i.e., if 
(Rq) = ^ Rq ^ for all q,c,c', then in a maximal coupling | Tf , Tf | of any 

connection <R q ,Rq > we have ( T q T q ) = 1 , or equivalently, 


Pr 



= 1 . 


(19) 
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In other words, in a maximal coupling the measurements in a connection are 
modeled as being essentially “the same”. This simple observation allows us to 
make use of the notion of maximal couplings in the following rigorous version 
of the traditional understanding of contextuality. 

Definition 6.4. A consistently connected cyclic-4 system ffj is noncontextual 
if it has a coupling (fill) in which (Sf, Sf) , (S'!, Sf) , (Sf, Sf) , (Sf, Sf) are maxi¬ 
mal couplings for the corresponding connections (R(,7?f) , , (i? 3 ,i?g) , (Rl,Rf), 

i.e., if 


Pr 

[Si 1 = Sf] 

= 1, Pr 

s! = si 

Pr 

Q 2 c3 

°3 — °3. 

= 1, Pr 

sf = sf; 


If such a coupling does not exist, the system is contextual. 

This definition allows one to preserve the spirit of the traditional understand¬ 
ing (the context-irrelevance hypothesis: R c q and R q are always “the same”) while 
adhering to the logic of the CbD approach: R c q and R q are not only different, 
they are not even stochastically interrelated. From this point of view, the follow¬ 
ing theorem, first proved mutatis mutandis by Fine in Refs 1 22 112.31 . summarizes 
the traditional analysis of contextuality for the cyclic-4 systems. 

Theorem 6.5. A consistently connected cyclic-4 system © is noncontextual 
(by Definition |6'.^[ ) if and only if \11\) is satisfied. 

This is a special case of the theorem 17.21 below, which in turn is a special 
case of a theorem proved in Ref. [4] (see also Refs. mmmm) that applies to a 
broad class of cyclic systems, of which cyclic-4 ones are a special case. 

7 A general definition and criterion of contextu¬ 
ality in the CbD framework 

The fact that we relate Definition 16.41 to the notion of maximal couplings for 
connections reflects the intuition we are guided by and suggests a natural way 
of generalizing contextuality beyond consistently connected systems. 

The intuition in question can be explicated as follows. For an inconsistently 
connected system, we interpret the non-coincidence of the distributions of R q 
and R q as evidence that changes in context, c —»• c'. “directly” influence the mea¬ 
surement of q. For instance, in the Alice-Bob entanglement paradigm, if the two 
measurements are time-like separated, Alice’s choice of the spin axis can influ¬ 
ence Bob’s measurement along a given axis. This is referred to as “signaling”. 
It is also possible that a Charlie who receives information from both Alice and 
Bob and records both their settings and their measurement results makes sys¬ 
tematic errors in recording Bob’s results depending on Alice’s settings. This is 
referred to as “context-dependent biases”. Whatever the cause, when we model 
these “direct” influences by a coupling | T q , T q j of R q j, we translate the 
differences in distributions into differences in values: as c changes into d , the 
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value of changes into a corresponding value of . In a maximal coupling 
we do this in the maximally conservative way: the values of T ^ and remain 
the same as often as it is allowed by their individual distributions (in particular, 
they remain always the same if the distributions are the same). Modeling by 
such a coupling is always possible if | R c q ,R c q j is coupled in isolation. Now, if 
this is also possible for all the connections taken together, within the framework 
of an overall coupling of the entire system, we can say that direct influences are 
sufficient to account for the system. If, however, this is not possible, then the 
maximal couplings for different connections are not mutually compatible: we 
interpret this as evidence that we need more than direct influences to account 
for the system. This “more” is what we call contextuality, as distinct from direct 
influences. 

The generalization of Definition 16.41 to arbitrary systems of measurement 
therefore is straightforward: one can simply drop the qualification “consistently 
connected” and use the general form of Theorem 16.31 

Definition 7.1. A cyclic-4 system £4]) is noncontextual if it has a coupling (1141) 
in which , (S^S'l) , (S'!, S'!) , (S'!, S'!) are maximal couplings for the 

corresponding connections , (R^R^) , (R\,Rf), i.e., if 


Pr 

si = sf] 

= i - 

Pr 

R\ = 1 

-Pr 

R\ = 1 

Pr 

si = si 

= i - 

Pr 

R\ = i 

-Pr 

Rl = 1 

Pr 

si = si 

= i - 

Pr 

Rl = i 

-Pr 

Rl = i 

Pr 

'si = si 

= i - 

Pr 

Rl = 1 

-Pr 

R\ = 1 


This is arguably the most conservative generalization of Definition 16.41 but 
it is sufficient to deal with all conceivable cyclic-4 systems. The correspondingly 
generalized version of Theorem 16.51 is as follows [I]|M5J[I2] • 

Theorem 7.2. A cyclic-4 system ^ is noncontextual if and only if 

CHSH - ICC < 2, (22) 

where 

CHSH = max (± (RlRl) ± {R 2 2 Rl) ± ( R 3 3 Rl) ± (R^Rf)) , (23) 

odd number 
of -’s 


and 

ICC = | (R{) - + |{^> - {Rl )| + |<iZ§> - (Rl)\ + |(i?I) - (^)| ■ (24) 

The abbreviations in this theorem are as follows. CHSH is the left-hand side 
expression in the classical CHSH inequality ED. named so after the authors of 
Ref. [20| . ICC is a measure of inconsistency of the connectedness [T[[5lil2|: if it 
is zero, then the criterion (122D reduces to the CHSH inequality (fTTD . and the 
theorem above reduces to Theorem 16.51 
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Table 3: An inconsistently connected cyclic-4 system. In accordance with the 
general Definition 17.11 the system is contextual if and only if p is not 0 or 
1. In accordance with the narrow (traditional) Definition 16.41 the notion of 
contextuality is not applicable unless p = i/ 2 ; in all other cases the criterion 
ED is not derivable and the value of CHSH is not interpretable. 

R\ Rj 



+1 

-1 


+1 

i/2 

0 

i/2 

-1 

0 

i/2 

i/2 


i/2 

i/2 




+ 1 

-1 


+ 1 

i/2 

0 

i/2 

-1 

0 

i/2 

i/2 


i/2 

i/2 



R\ 





+1 

-1 




+ 1 

i/2 

0 

i/2 



-1 

0 

i/2 

i/2 





i/2 

i/2 





+ 1 

-1 



+ 1 

0 

P 

P 


-1 

1 -p 

0 

1-p 


1-p 

P 



R\ 


R 


4 

1 




To illustrate the computations, consider the modification of the example of 
Table [5] in Table [3] The value of CHSH in the system is 4, the same maximal 
possible value as in Table [2] But 

ICC = 1(7?!) - <7?i)| + \(R%) - (Ri )| = |4 p - 2|, 


whence 


CHSH — ICC = 4 — |4p — 2| 


4 (1 — p) + 2 if p > i/2, 
4p + 2 if p < 1 / 2 . 


The system is noncontextual by the criterion (B^l) only if p = 0 or p = 1; for 
other values the difference exceeds 2. 

One can see in (1^1) an algebraic realization of the intuition described above, 
of direct influences being or not being sufficient to account for the system. The 
direct influences are represented by the term ICC while CHSH — 2 can be viewed 
as the total of the dependence of measurements on contexts. If ICC is not large 
enough, it does not exceed CHSH — 2, and in this sense it is “insufficient” to 
explain the total of the context-dependence. The difference is the “unexplained” 
context-dependence that we view as true contextuality. 

For the arguments in favor of generalizing the definition of contextuality to 
inconsistently connected systems, see Refs. \mm- Let us emphasize here a 
pragmatic argument. Since one cannot prove a null hypothesis, dealing with ex¬ 
perimental results one can never be certain that consistent connectedness holds. 
If one confines one’s definition of contextuality to the latter case (Definition |H3D, 
one’s determination that a system is contextual would always be “suspended” 
and could be easily invalidated if with a larger sample size a small inconsistency 
were detected. Moreover, small inconsistencies should be expected in virtually 
all real experiments, as one can never be rid of all systematic sources of error 
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or make them perfectly counterbalanced. None of this poses a problem for Def¬ 


inition [7J] small values of ICC, unless CHSH is very close to 2, will not change 
one’s determination that a system is or is not contextual. 


8 The “Alice-Bob” EPR-B version of the cyclic-4 
system 


The contextuality analysis of a cyclic-4 system does not depend on what pre¬ 
cisely the properties { 91 , 92 , 93 , 94 } are, nor on what the contexts { 01 , 02 , 03 , 04 } 
are. All that matters is that each context involves two properties measured 
“together”, no two contexts share more than one property, each property is mea¬ 
sured in precisely two different contexts, and each measurement has two possible 
values. 

The importance of the cyclic-4 systems, however, is primarily related to the 
entanglement paradigm in quantum mechanics: two particles created in a sin¬ 
gular state move away from each other, reaching simultaneously two observers, 
one of them Alice and another Bob; Alice chooses one of two fixed axes and mea¬ 
sures her particle’s spin along it; Bob does the same with his particle. Assuming 
the two particles are spin- 1 ^ ones, the outcomes are binary random variables. 
Alice’s two fixed axes can be denoted ai = q\ and a 2 = 93 , and Bob’s axes can 
be denoted 61 = q 2 and 62 = 94 - The contexts then can be identified by the 
pairs of axes simultaneously chosen by Alice and Bob: 


ci = { 91 , 92 } = {ai,&i}, c 2 = { 92 , 93 } = {h,a 2 }, 
C 3 = {93,94} = {a 2 ,b 2 }, c 4 = { 94 , 91 } = {^ 2 ,ai}- 


(25) 


We can now simplify notation for the measurements by denoting Alice’s 
measurements by A and Bob’s by B. We will use two subscripts of which 
(note the asymmetry) the first one refers to Alice’s choice of one of her two 
axes, and the second one refers to Bob’s choice of one of his two axes. This 
notation ensures that Ay and Bij (and only these, identically subscripted pairs) 
are jointly distributed. Random variable Ay is interpreted as the outcome of 
measuring property a,; in the context of being measured together with property 
bj (whether or not the distribution of Ay depends on j): B^ is the outcome of 
measuring property bj in the context of being measured together with property 
a-i (whether or not the distribution of B^ depends on i). 

The correspondence between the general R c q notation and the special Ay-By 
notation is as follows: 



(26) 


The entanglement paradigm serves as a template for other applications, with 
very different meanings of the properties a, b (see Ref. (3j for examples in psy- 
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Table 4: The properties and their measurements in the EPR-B-like system used 
in Ref. [ 6 ] . In each trial a human respondent is asked to choose “wind directions” 
along a pair of spatial orientations in the Rose of the Winds. Each pair consists 
of one cardinal orientation (ai or 02 ) and one intercardinal orientation (bi or 
62 ). For instance, the respondent can be given ( 02 , 61 ), in which case her pos¬ 
sible choices would be “East and Northeast”, “East and Southwest”, “West and 
Northeast”, and “West and Southwest”, corresponding to four possible values of 
(-A2I) I?2l)- 


Properties, q 
(Rose of the Winds) 

Measurements 
(Choices of direction) 

Context 

(Options for other direction) 

North-South (ai) 

A\j = +1 = North 

Aij = —1 = South 

measured together with bj 

O'= 1.2) 

East-West (a2) 

A“2j — -|-1 — East 

A2j = —1 = West 

Northeast-Southwest (61) 

Bn = +1 = Northeast 

Bn = —1 = Southwest 

measured together with ai 
(i = 1,2) 

Northwest-Southeast (62) 

Bi2 = +1 = Southeast 

Bi2 = —1 = Northwest 


chology). In this paper we will use as an example the experiment by Aerts and 
Sozzo [B], where a and b are cardinal and intercardinal orientations chosen in the 
Rose of the Winds, and the measurements are choices (by human respondents) 
of one of two possible wind directions along each of these orientations, as shown 
in Table HI 

The results of this experiment (see Table[5]) yield the following computations: 

CHSH = 2.47, ICC = 0.71, 


whence 


CHSH - ICC = 1.76 < 2. 


We conclude that the data exhibit no contextuality in the sense of Definition 

o 


9 Methodological Remarks 

One may, of course, reject the generalized Definition [7J] and stick with the tradi¬ 
tional understanding (Definition 16.411 . but the latter applies only to consistently 
connected systems of measurements, whereas the inconsistency of the connect¬ 
edness in the data of Ref. [B] is clearly present in spite of the small sample size 
used (p < 0.03 for the difference between (B n) and (U 21 )). As explained in 
Section [5l in this case no inequality can be derived for CHSH (except for the 
trivial CHSH < 4), and no interpretation is known for whether CHSH exceeds 
or does not exceed any value below 4. 

The authors of Ref. [BJ are aware of the difficulties caused by inconsistent 
connectedness in judging violations of the CHSH inequality (see Ref. 0 ) , SO 
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Table 5: The results of the experiment reported in Ref. [B], All probability 
estimates are computed from polling 85 people, treating them as 85 realizations 
of one and the same pair of random variables in each of the four contexts. 

Bn B12 



+1 

-1 


+1 

0.13 

0.55 

0.68 

-1 

0.25 

0.07 

0.32 


0.38 

0.62 


+ 1 

-1 

+1 

0.13 

0.38 

0.51 

-1 

0.42 

0.07 

0.49 


0.55 

0.45 



B21 



+ 1 

-1 


+1 

0.47 

0.12 

0.59 

-1 

0.06 

0.35 

0.41 


0.53 

0.47 


+ 1 

-1 

+1 

0.09 

0.44 

0.53 

-1 

0.38 

0.09 

0.47 


0.47 

0.53 



B22 


they propose a computational modification of their data that makes all marginal 
distributions uniform. They justify this procedure by an isotropy argument, ac¬ 
cording to which any direction in the Rose of the Winds plane could be taken 
to play the role of the vector North, with all other directions rotated to pre¬ 
serve their angles with respect to this new North. Using this argument, Aerts 
and Sozzo average the observed probabilities in such a way that all marginal 
probabilities become 1/2 while the value of CHSH does not change. 

An isotropy argument, however, as any other symmetry argument, only 
makes sense if formulated as invariance of a relevant feature (in our case, mea¬ 
surement) with respect to certain changes. To give a trivial example, the length 
of a segment in the Euclidean plane is invariant with respect to its rotations. 
Therefore one can average the length measurements of a radius at different ori¬ 
entations, and this averaging would only improve statistical reliability of the 
measurements rather than change the true measured value. By contrast, we see 
in Table [5] that the measurements A,; ;i and are not invariant with respect to 
rotations: e.g., Pr [An = —1, B n = 1] is different from Pr [B 21 = —1, A 21 = 1], 
although the ordered pair of the orientations in the second case, (^ 1 ,( 12 ), is a 
rotated by 7 t/4 copy of the orientation pair in the first case, (a\,bi). Even 
more obvious: Pr [An = 1, B n = 1] is not the same as Pr [An = —1, B n = —1] 
although they pertain to orientation pairs rotated by ir with respect to each 
other. 

The latter example is important for the computational modification of the 
data used in Ref. [5] . This procedure achieves uniform marginals while retaining 
the value of CHSH precisely because it considers the jointly-opposite outcomes 

(Ajj x, Bij — y) and (A^j — x , B^j — y) , (27) 

with x,y £ {—1,+1}, to be “equivalent”. The probability of each of them is 
therefore replaced with their average: 

Pr [Ajj = x , B^ = y\ + Pr [A^ = -x, Bjj = -y\ 


2 


( 28 ) 
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Let us denote by A*j and B'*- the new random variables with these symmetrized 
distributions. Due to the symmetry, 

<^.)=Pr 

Pr 

At the same time, 

( A 

and since it follows from (l28l) that 


A ij — +1 

B h = + 1 


-Pr 

-Pr 


B* ' 
v / 


A h = -i] = o, 

Btj = -1] = 0. 

(29) 


(30) 

ij = Bij] , 

(31) 


L « y J 
the value of CHSH remains intact. 

This averaging procedure has been described in the quantum physics liter¬ 
ature by Masanes, Acin, and Gisin [21] ; it is the first part of their “depolariza¬ 
tion” procedure. There, however, it is meant to be a data generation or data 
doctoring procedure (not a data analysis one), involving either direct signaling 
between Alice or Bob, or a third party, Charley, who receives from Alice and 
from Bob their settings and their measurement results, flips a fair coin, and 
multiplies these measurement results (always both of them) by +1 or —1 ac¬ 
cordingly. Since this averaging procedure is universal (applicable to all EPR-B 
systems without exception), if taken as a data analysis procedure it amounts to 
ignoring the marginal probabilities altogether and simply defining contextuality 
(or entanglement) as any violation of the CHSH inequality. 

One might ask: why not adopt this approach? It is definitely simpler than 
the approach advocated by us, which involves (a) labeling the measurements 
contextually, (b) determining subsystems of measurements that are stochasti¬ 
cally unrelated to each other, (c) defining contextuality in terms of the (non)existence 
of a coupling for these subsystems with certain constraints imposed on the con¬ 
nections (measurements of the same properties in different contexts); and (d) de¬ 
riving CHSH inequalities or their generalizations as theorems nmMnHnHH]. 

The answer to the question is that adopting the definition in question, in 
addition to being arbitrary, would make construction of contextual systems 
child’s play: the contextual system will become ubiquitous and obvious, includ¬ 
ing systems in classical mechanics and human behavior that no one normally 
would think of as contextual. Moreover, with the definition in question one 
would have to forget about the “quantum” motivation for seeking contextuality, 
because these contextual systems in classical mechanics and human behavior 
would violate Tsirelson (or Cirel’son) bounds 1151211 as easily as they would the 
CHSH ones. 


10 Contextuality as child’s play 

We will consider just one example, with multiple possible implementations. Ta¬ 
ble [G] represents the probabilities of [Ay = cn, B tJ = y] in a hypothetical EPR- 
B-type system (i,j £ {1,2}, x, y £ {—1,+1}). Here, 

CHSH =4, 
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Table 6: A deterministic system that is (as any other deterministic system) 
noncontextual by Definition 17.11 but is “maximally contextual” if one ignores 
marginal probabilities (or, equivalently, averages over jointly-opposite out¬ 
comes) . 

.Bn B12 



+1 

-1 


+1 

1 

0 

1 

-1 

0 

0 

0 


1 

0 


+1 

-1 

+1 

1 

0 

1 

-1 

0 

0 

0 


1 

0 



B21 



+1 

-1 


+1 

1 

0 

1 

-1 

0 

0 

0 


1 

0 


+1 

-1 

+1 

0 

0 

0 

-1 

1 

0 

1 


1 

0 



B22 


A 12 


A22 


the algebraically maximal possible value for CHSH. The system is, however, 
noncontextual by Definition 17.11 and Theorem 17.21 ICC in it equals the value of 
(A 2 i) - {A 22 ) = 2, whence 

CHSH — ICC = 2 

In fact, any deterministic system (one in which all probabilities are 0 or 1) is 
noncontextual. A simple way of demonstrating this is as follows: a deterministic 
system has a single coupling, and its subcouplings corresponding to connections 
(each of which is deterministic) are their only couplings, hence maximal ones. 

However, if in Table [6] one decides to ignore marginal probabilities, the sys¬ 
tem is maximally contextual (and in fact more contextual than allowed by quan¬ 
tum mechanics). 

It is trivial to find or construct a system described by Table [6] To begin with 
conceptual combinations, consider, e.g., the experiment in which the properties 
a, b and measurements A , B are identified as shown in Table [7] Such an exper¬ 
iment would yield Table [5] unless the participants choose to deliberately give 
wrong responses. There is, in fact, nothing wrong in considering the conceptual 
inferences like “Green Triangle is Green” and “Green Triangle is Triangular” as 
examples of contextuality or “(super-quantum) entanglement,” but this looks to 
us as making the concept of contextuality too trivial to be of interest. 

Another example of “conceptual entanglement” involves creation of new con¬ 
cepts in children by means of teaching them a simple nonsense verse: 

Pips and Nips are Zops, not Zogs. 

Pops and Nops aren’t Zops nor Zogs. 

Pips and Nops are Gots, not Gons. 

Pops and Nips aren’t Gots nor Gons. 


Children who learned this piece of poetry by heart (or are allowed to look at 
it while responding) would confidently respond to the questions like “Are Pips 
Zops?” and “Are Pops Gots?” The resulting table of the probabilities for them 
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Table 7: A trivial system described by Table [ 6 ] if people are instructed to choose 
correct responses. In the context (ai, 6 i) the choice is among the sentences: 
“Green Triangle is Green”, “Green Triangle is Red”, “Yellow Circle is Green”, 
and “Yellow Circle is Red”. In the context ( 0 . 1 , 62 ) the choice is among the 
sentences: “Green Triangle is Triangular”, “Green Triangle is Square”, “Yellow 
Circle is Triangular”, and “Yellow Circle is Square”. In the context ( 02 , 61 ) the 
choice is among the sentences: “Green Circle is Green”, “Green Circle is Red”, 
“Yellow Triangle is Green”, and “Yellow Triangle is Red”. In the context ( 02 , 62 ), 
the choice is among the sentences: “Green Circle is Triangular”, “Green Circle 
is Square”, “Yellow Triangle is Triangular”, and “Yellow Triangle is Square”. 


Properties 
(choices between) 

Measurements 

(choices) 

Context 

Green Triangle and Yellow Circle (ai) 

A\j = +1 = Green Triangle 
A\j = — 1 = Yellow Circle 

together with bj 

(3 = 1 . 2 ) 

Green Circle and Yellow Triangle ( 0 , 2 ) 

A 2 j = +1 = Green Circle 

A 2 j = —1 = Yellow Triangle 

Green and Red ( 61 ) 

Bn = +1 = Green 

Bn = —1 = Red 

together with ai 

(3 = 1 , 2 ) 

Triangular and Square ( 62 ) 

Bi 2 = +1 = Triangular 

Bi 2 = — 1 = Square 


will be the same as in Table | 6 l on denoting the conditions and outcomes as in 
Table[ 8 l 

Finally, here is a scenario of creating Tabled] in a purely classical physical 
situation. There is a gadget “Alice” that responds to inputs i,j € {1,2} by 
computing A = min ( 1 , 2i + 2j — 5), and a gadget “Bob” that outputs 1 no 
matter what. This example is essentially identical to one given by Filk in Figure 
3 of Ref. (27]. No physicist, as it seems to us, would call the system consisting 
of these two gadgets entangled or contextual. It is simply that both inputs 
influence one of the outputs (Alice’s), resulting in the observed inconsistent 
connectedness. 


Table 8 : A trivial system described by Table[0]if people are instructed to choose 
correct responses given by the nonsense verse about Pips, Nips, etc. 


Properties 
(choices between) 

Measurements 

(choices) 

Context 

Pip and Pop (ai) 

A\j = +1 = Pip 

Aij = —1 = Pop 

together with bj 

(3 = 1,2) 

Nip and Nop (a.2) 

A 2 j = +1 = Nip 
A 2 j = — 1 = Nop 

Zop and Zog (61) 

Bi 1 = +1 = Zop 

Bi 1 = -1 = Zog 

together with ai 

0 = 1 , 2 ) 

Got and Gon (62) 

Bi2 — +1 — Got 

Bi2 = —1 = Gon 
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11 Conclusion 

Inconsistent connectedness is almost a universal rule in behavioral and social 
data (e.g., it is very plausible that the task of choosing between the North and 
South winds affects the probabilities with which one, in the same trial, chooses 
between the Northeast and Southwest winds). It is therefore a sound scientific 
strategy to make inconsistent connectedness part of one’s theory of contextu- 
ality. Inconsistent connectedness means that the measurement of a property 
is directly influenced by the measurement of other properties, and this may or 
may not be sufficient to account for a system’s behavior. For instance, in the 
experiment described in Ref. [28] we find violations of consistent connectedness 
due to context-dependent biases in measurements, but the detailed analysis pre¬ 
sented in Ref. [5] shows that contextuality, in the sense of Definition 17.11 is still 
prominently present in these data. By contrast, the system in Table [6] is non- 
contextual by Definition 17.11 which means that the direct influences it entails 
are sufficient to explain its behavior (no contextuality exists “on top of” these 
input-output relations). The same conclusion applies to the many different ex¬ 
periments analyzed in Ref. [3] and to the experiment reported in Ref. [B]. We 
have argued that the justification proposed in the latter for averaging across 
different contexts is not tenable, and the reason it works as desired is that it 
is equivalent to ignoring marginal probabilities altogether. The consequence of 
such ignoring, in addition to being ad hoc, is that contextuality becomes trivial 
and uninteresting. 

We make no claim, however, that contextuality, in the sense of our definition, 
cannot be found in behavioral data: we merely say that we have not found it 
yet. We also acknowledge that there may be viable alternatives to our Definition 
O that also take into account inconsistent connectedness in a different way. 

Finally, we would like to refer the reader to the concluding part of Ref. [3] 
to emphasize that absence of contextuality in behavioral and social systems 
does not mean that quantum formalisms are not applicable to them. The so- 
called QQ equality, in our opinion the most impressive outcome of quantum 
cognition research to date [29ll30| . provides a clear illustration of how absence 
of contextuality can in fact be precisely a prediction derived from quantum 
theory. 
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